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Abstract 

Let $ be a uniformly distributed random fc-SAT formula witfi n variables and m clauses. We prove 
that the Walksat algorithm from II16II17I finds a satisfying assignment of $ in polynomial time w.h.p. 
if m/n < p ■ 2*/fc for a certain constant p > 0. This is an improvement by a factor of G(fc) over the 
best previous analysis of Walksat from (9). 
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1 Introduction 

Let k > 3 and n > 1 be integers, let r > be a real, and set m ~ \rn]. Let $ — #fc(n,m) be 
a propositional formula obtained by choosing a set of m clauses of length k over the variables V = 
{xi, . . . ,Xn} uniformly at random. For k, r fixed we say that ^> has some property V with high probability 
('w.h.p.') if lim„^oo P E V] = 1. If I E {xi,xi, . . . , a;„, a;„} is a literal, then we denote its underlying 
variable by |/|. Furthermore, we define sign(Z) = — 1 if Hs a negative literal, and sign(Z) = 1 if Hs 
positive. 

The interest in random fc-SAT stems largely from the experimental observation that for certain densities 
r the random formula # is a challenging algorithmic benchmark |[T4] |. However, analyzing algorithms on 
random formulas is notoriously difficult. Indeed, the current rigorous results for random fc-SAT mostly 
deal with algorithms that are extremely simple both to state and to analyze, or with algorithms that were 
specifically designed so as to allow for a rigorous analysis. More precisely, the present analysis techniques 
are essentially confined to simple algorithms that aim to construct a satisfying assignment by determining 
the value of one variable at a time for good, without any backtracking or reassigning variables at a later time. 
By contrast, most 'real-life' satisfiability algorithms actually rely substantially on reassigning variables. 

Maybe the simplest example of a natural algorithm that eludes the standard analysis techniques is 
Walksat |[T6]fr7l . In combination with various heuristic tweaks, Walksat is highly successful in prac- 
tical SAT-solving ifTSl . Starting from the all-true assignment, this well-known local search algorithm tries 
to find a satisfying assignment of its input fc-CNF formula $ = $i A • • • A $,„ as follows. If the current 
assignment a is satisfying, then clearly there is nothing to do and the algorithm terminates. Otherwise, the 
algorithm picks an index i such that clause $i is unsatisfied uniformly at random among all such indices. 
Clause $i is a disjunction of fc literals $ii V • • • V ^ik, and Walksat picks an index j G {1, . . . , fc} 
uniformly at random and flips the value assigned to the variable | underlying the literal Of course, 
this ensures that under the new assignment clause $i is satisfied, but flipping $y may create new unsat- 
isfied clauses. If after a certain number of iterations no satisfying assignment is found, Walksat gives 
up and concedes failure. The pseudocode is shown in Figure [T] In the worst case, it can be shown that 
Walksat(<I', (2 — 2/fc)'^^+°'^-'^") will find a satisfying assignment of a satisfiable input formula $ on n 
variables with probability 1 — o(l) 

Although Walksat is conceptually very simple, analyzing this algorithm on random formulas is a 
challenge. Indeed, Walksat does not follow the naive template of the previously analysed algorithms that 
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Algorithm 1.1 Walksat($, Tmax) 

Input: A fc-CNF $ = (f>i A • • • A over the variables xi,. . . ,Xn and a number Tmax > 0. 
Output: An assignment a : V ^ {0, 1}. 

0. Initially, let a{xi) = 1 for i = 1, . . . , rt. 

1 . Repeat the following Tmax times (with independent random choices) 

2. If (T is a satisfying assignment, then halt and output a. 

3. Otherwise, choose an index i such that clause -J?, is unsatisfied under a uniformly at random. 

4. Suppose that <I>i = <I>ii V ■ ■ • V $ife- 

Choose an index j e {1, . . . , fc} uniformly at random. 
Flip the value of a{\^ij\). 

5. Return 'failure'. 

Figure 1: The Walksat algorithm. 

assign one variable at a time for good, because its random choices may (and will) lead Walksat to flipping 
quite a few variables several times over This causes stochastic dependencies that seem to render the 
differential equation method, the mainstay of the previous analyses of random fc-SAT algorithms, useless. 
The goal of the present paper is to present an analysis of Walksat via a different approach that allows us 
to deal with the stochastic dependencies. Our main result is as follows. 

Theorem 1.2 There is a constant fco > 3 such that for and any 

0<r<^.2Vfc, 
Walksat(^', n) outputs a satisfying assignment w.h.p. 

1.0.1 Related work. 

To put Theorem ll.2l in perspective, let us compare it with other results on random fc-SAT algorithms. The 
simplest conceivable one is presumably UnitClause. Considering all variables unassigned initially, 
UnitClause sets one variable at a time as follows. If there is a clause in which fc — 1 variables have been 
assigned akeady without satisfying that clause (a 'unit clause'), the algorithm has to assign the fcth variable 
so as to satisfy the unit clause. If there is no unit clause, a currently unassigned variable is chosen randomly 
and is assigned a random truth value. As UnitClause is extremely simple and does not backtrack, it can 
be analyzed via the method of differential equations f[]. The result is that UnitClause finds a satisfying 
assignment with a non-vanishing probability so long as m/n < (1 — Ofc(l))| • 2'^/fc, where 0^(1) hides 
a term that tends to as fc gets large ||6l. Furthermore, ShortestClause, a natural generalization of 
UnitClause, succeeds for m/n < (1 — Ofc(l))c^/8-2*^/fc with /i/g/i probability [7:1. Indeed, the algorithm 
can be modified so as to succeed with high probability even for m/n < (1.817 - Ofc(l)) • 2'^/fc by allowing 
a very limited amount of backtracking IITOl . Finally, the algorithm Fix from ||8l, which was specifically 
designed for solving random fc-SAT instances, succeeds up to m/n < (1 — Ok{l))2'^ ln(fc)/fc. By com- 
parison, non-constructive arguments show that the threshold for the existence of a satisfying assignment is 
(l + Ofe(l))-2'=ln2||2l[l. 

In summary, Theorem I 1 . 2 I shows that Walksat is broadly competitive with the other known algorithms 
for random fc-SAT. That said, the main point of this paper is not to produce a better algorithmic bound for 
random fc-SAT, but to address the methodological challenge of analyzing algorithms such as Walksat 
that may reassign variables. This difficult aspect did not occur or was sidestepped in the aforementioned 
previous analyses |[T]|7][8][l0l. Indeed, the lack of techniques for such analyses is arguably one of the most 
important shortcomings of the current theory of random discrete structures. 

Theorem 11.21 improves substantially on the previous analyses of Walksat, at least for general fc. 
The best previous result for this case showed that w.h.p. Walksat will find a satisfying assignment with 
Tmax = n if m/n < p' ■ 2''/fc^, for a certain constant p' > |[9|- The proof of this result is based on a 



rather simple observation that allows to sidestep the analysis of the stochastic dependencies that arise in the 
execution of Walk sat. However, it is not difficult to see that this argument is confined to clause/variable 
densities ra/n < Theorem 1 1.2l improves this result by a factor of 9(fc). 

Furthermore, the techniques of Alekhnovich and Ben-Sasson Q show that for any k Walks at will 
w.h.p. find a satisfying assignment within 0{7i) iterations if m/n < rk-pure, where rk-pure is the 'pure 
literal threshold' . The analysis in [j4] depends heavily on the fact that the combinatorial structure of the 
hypergraph underlying the random fc-CNF $ is extremely simple for rn/n < rk-pure- Furthermore, 
because rk-pure in the limit of large k ITSl . this result is quite weak for general k. Yet H remains the 
best known result for 'small' k. For instance, in the case k = 3 the pure literal bound is r^-pure ~ 1-63 15]. 

Monasson and Semerjian ||T9l applied non-rigorous techniques from statistical mechanics to study the 
Walksat algorithm on random formulas. Their work suggests that Walksat ($, 0(?i)) will find a satis- 
fying assignment w.h.p. if m/n < (1 — Ofc(l))2'^/fc. Theorem [O] confirms this claim, up to the constant 
factor 1/25. 

In contrast to the previous 'indirect' attempts at analyzing Walksat on random formulas |]4l|9], in the 
present paper we develop a technique for tracing the execution of the algorithm directly. This allows us to 
keep track of the arising stochastic dependencies explicitly. Before we outline our analysis, we need some 
notation and preliminaries. 

2 Preliminaries 

We let Q,k{'n^m) be the set of all fc-SAT formulas with variables from V = {xi, . . . ,x„} that contain 
precisely m clauses. To be precise, we consider each formula an ordered m-tuple of clauses and each 
clause an ordered fc-tuple of literals, allowing both literals to occur repeatedly in one clause and clauses to 
occur repeatedly in the formula. Thus, \nk{n,m)\ = {2n)''™. LetS]fc(7i,m) be the power set of rjfe(n, m), 
and let P — Pfc(n, m) be the uniform probability measure. 

As indicated above, we denote a uniformly random element of flk(n, m) by In addition, we use the 
symbol $ to denote specific (i.e., non-random) elements of flk{n, m). If <i> S ilfe(n, m), then $i denotes 
the 2th clause of and $y denotes the jth literal of If Z C [m] is a set of indices, then we let 

Recall that a filtration is a sequence {Tt)o<t<T of cr-algebras J^t C I]/c(n,TO) such that C Tt+i 
for all < < < r. For a random variable X : flkin, m) — > R we let E [^|J^t] denote the conditional 
expectation. Thus, E[X|J"t] : Q,k{n,m) R is a J"t -measurable random variable such that for any 
A € J^f we have 

Y^m\H (<i>)-E^(*)- 

Also remember that P [■\Ft\ assigns a probabiUty measure P [■\J-t\ to any $ S ^k{n, m), namely 

P [-iTt] m-.Ae Sfe(n, m) ^ E [1^1 J-*] ($), 
where 1a is the indicator of the event A. 

Lemma 2.1 Let {J-t)a<t<T be a filtration and let {Xt)i<t<T be a sequence of non-negative random vari- 
ables such that each Xt is Ft-fneasurable. Assume that there are numbers > such that E < 
£,tforall l<t<T. r/zenE[ni<t<,^t|-^o] < Y{i<t<rit. 

Proof. For 1 < s < r we let Ys = Y[t=i -^t- Let s > 1. Since is J^s_i -measurable, we obtain 

E [Ys\To] = E J-q] = E [E [y,_iX,|^,,_i] \To] = E [y,_iE [X,\Ts-i] \J^o] < [^-il-Fo] , 

whence the assertion follows by induction. □ 
We also need the following tail bound ("Azuma-Hoeffding", e.g. lfT2l p. 37]). 

Lemma 2.2 Let {Mt)o<t<T be a martingale with respect to a filtration {Tt)o<t<T such that Mq = 0. 
Suppose that there exist numbers Ct such that \Mt — Mt-i \ < Ctfor all I < t < t. Then for any A > we 
have P [\Mr \ > A] < exp [-Xy{2J2l^^ c?)] . 



A fc-CNF 4> = $1 A • • • A $„i gives rise to a bipartite graph whose vertices are the variables V and the 
clauses {$i : i £ [m]}, and in which each clause is adjacent to all the variables that occur in it. This is the 
factor graph of $. For a vertex v of the factor graph we denote by N{v) = N,j,{v) the neighborhood of v 
in the factor graph. For a set Z C [m] we let N{^z) ~ Ujgz ^i^i) the set of all variables that occur 
in the sub-formula ^z- 

Let A, B be two disjoint sets of vertices of the factor graph. Recall that a l-fold matching from A to B 
is a set AI of A-B-edges such that each a G ^ is incident with precisely I edges from M, while each b E B 
is incident with at most one edge from AI. We will make use of the following simple expansion property 
of the factor graph of random formulas. 

Lemma 2.3 There is a constant ko > such that for all k > ko and for m/n < 2*^1112 the random 
formula $ has the following property w.h.p. 



For any set Z C [m] of size \Z\ < n/k"^ there is a 0.9k-fold matching from to iV($ 
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Proof. We start by proving that w.h.p. the random formula $ has the following property. 

For any set U of < n/k variables we have \{i e [to] : 7V(*i) C ?7}| < l.l\U\/k. (2) 

To prove (|2]i we use a 'first moment' argument. ForsetJJ C V welet Xjj = lif |{i G [m] : iV($i) C U}\ > 
l.l\U\/k, and we set Xjj = otherwise. Then 



E [Xu] ^P[Xu^l]< 
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large, we obtain 
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Summing the last expression over I < u < n/k and assuming that k > ko is large enough, we see that 
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Thus, J2i<u<n/k ~ *-* W.h.p. by Markov's inequality. Hence, (|2]i holds true w.h.p. 

Now, assume that $ satisfies Let Z C [to] be a set of size |Z| < n/fc^. Let Y d Z and 
let U = iV(*y). Then \U\ < n/k, and 7V(*i) C U for any i e Y. Therefore, © implies that 
|r| < l.l|[/|/fc,i.e., \U\ > > 0.9fc|y|. Hence, the assertion follows from the marriage theorem. □ 

The following lemma states a second expansion-type property. 

Lemma 2.4 There exists a constant ko > such that for all k > ko and for any a > 0, A > 4 satisfying 
e < k~^ and < -{2c)~^^' the random formula $ with m/n < 2^ In 2 has the following property w.h.p. 



Let Z C [m] he any set of size \Z\ < en. Ifii, 
pairwise distinct indices such that 



e [m] \ Z is a sequence of 



|iV(*,J n Ni^zu{^r.l<3<s})\ > A foralll<s< I, 



(3) 



then I < en. 



Proof. It is clearly sufficient to prove that the desired property holds w.h.p. for all sets Z of size precisely 
\Z\ = en. Assume that there is a set Z and a sequence i = (zi, . . . , i;) of pairwise distinct indices in 
[m] \ Z of length I = en such that |A^(*iJ n N{'^x\j{iy.i<]<s})\ > A forall 1 < s < I. Then the sets 
Y = Uj=i ^{^ij ) \ N{^z) C V and Z have the following properties. 

a. |r| < e(fc- A)n. 

b. There is a set / C [m] \ Z of size |/| = en such that 7V(*i) C iV(*z) U F for all i e /. 

Property a. holds because each clause adds no more than k — X 'new' variables to Y, and b. is true for 
the set I ^ {ij : I < j < I}. 

To prove that w.h.p. there do not exist Z and i of length Z = en as above, we are going to show by a 
first moment argument that w.h.p. the random formula $ does not feature sets Y, Z that satisfy a. and b. 
More precisely, for sets Z C [m] of size \Z\ = en, Y C V of size \Y\ = e{k — X)n, and / C [m] \ Z of 
size |/| = en we let £{Z, Y, I) be the event that N{^^) C N{^z) U Y for all i e /. Then for any fixed 
Z, Y, I we have 
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because each of the k\I\ variable occurrences in the clauses is uniformly distributed over V. Hence, by 
the union bound, for large enough k 
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where the last inequality follows from our assumption that e < k"^ with k > fcp sufficiently large. Due 
to our assumption that e^ < ic(2e)~'"', (Hji yields P [3Z, Y, I : £{Z, Y, I)] < cxp(-£n) o(l), whence 
the assertion follows. □ 



Finally, it will be convenient to assume in our proof of Theorem ll.2l that the formula density r ~ m/n 
is 'not too small' and that the clause length k is sufficiently large. These assumptions are justified as the 
case of small k or very small r is already covered by ||9l- 

Theorem 2.5 (im) There is a constant fco > 3 such that for all k > ko and all r < ^ • 2*^/fc^ w.h.p. 
Walksat(#, n) will find a satisfying assignment. 



3 Outline of the analysis 

Throughout this section we assume that k > ko for some large enough constant ko > 0, and that r = 
m/n ^ p ■ 2^ /k with k~^ ^ P < Po = 1/25. We can make these assumptions as otherwise the assertion 
ofTheorem \1.2\ already follows from Theorem \2.5\ Furthermore, let 

A = Vfc and e = exp(-fc^/^). (5) 

The standard approach to analyzing an algorithm on random fc-SAT formulas is the method of deferred 
decisions, which often reduces the analysis to the study of a system of ordinary differential equations that 



capture the dynamics of the algorithm [[T]. Roughly speaking, the method of deferred decisions applies 
where the state of the algorithm after a given number of steps can be described by a simple probability 
distribution, depending only on a very few parameters determined by the past decisions of the algorithm. 
This is typically so in the case of simple backtrack- free algorithms such as Unit Clause. 

However, in the case of Walksat, this approach does not apply because the algorithm is bound to flip 
many variables more than once. This entails that the algorithms' future steps depend on past events in a 
more complicated way than the method of deferred decisions can accommodate. Hence, our approach will 
be to use the method of deferred decisions to trace the effect of flipping a variable /or the first time. But we 
will need additional arguments to deal with the dependencies that arise out of flipping the same variable 
several times. 

To get started, let us investigate the effect of first flip that Walksat performs. Let cr = 1 be 
the assignment that sets every variable to true. Clearly, a clause is unsatisfied under a iff it consists 
of negative literals only. As $ consists of m uniformly random and independent clauses, the number of 
unsatisfied clauses has a binomial distribution Bin(m, 2^^), and thus there will be (l+o(l))2~*''m ^ pn/k 
all-negative clauses w.h.p. To perform its first flip, Walksat chooses an index i e [m] such that is 
all-negative uniformly at random, then chooses a literal index j e [k] uniformly, and sets cr(|$ij |) to false, 
thereby satisfying clause 

But, of course, flipping | may well generate new unsatisfied clauses. We need to study their number. 
As #i is just a uniformly random all-negative clause, the random variable is uniformly distributed 
over the set of all n variables, and thus we may assume without loss that | = xi. Furthermore, if a 
clause becomes unsatisfied because variable xi got flipped, then xi must have been the only variable 
that appears positively in Now, the number of clauses whose only positive literal is xi has distribution 
Bin(TO, k/ (712'^) + 0(1/71^)). Indeed, the probability that a random clause has precisely one positive literal 
is kl2^, and the probability that this positive literal happens to be xi is 1 /n; the 0{l/n^) accounts for the 
number of clauses in which variable xi occurs more than once. Hence, the expected number of newly 
created unsatisfied clauses equals (1 + o(l))^^ ~ P- 

In summary, as we are assuming that p < po = 1/25 < 1, the expected change in the number of 
unsatisfied clauses as a result of the first flip is bounded from above by 

p - 1 + 0(1) < 0. 

(The precise value is even smaller because xi may occur in further all-negative clauses.) Thus, we expect 
that the first flip will indeed reduce the number of unsatisfied clauses. Of course, this simple calculation 
does not extend to the further steps of Walksat because knowing the outcome of the first flip renders the 
various above statements about clauses/literals being uniformly distributed invalid. 

To analyze the further flips, we will describe Walksat as a stochastic process. Our time parameter 
will be the numberof iterations of the main loop (Steps 2-A in Figure[TJ, i.e., the number of flips performed. 
To represent the conditioning of the random input formula imposed up to time t, we will define a sequence 
of random maps (7r()t>o. These maps reflect for each pair (i,j) G [m] x [k] the conditional distribution of 
the literals €>y , given the information that Walksat has revealed after performing the first t flips. More 
precisely, the value of 7rt{i,j) will either be just the sign of the literal ^ij, or the actual literal itself. 
In the initial map ttq, we have Tro{i,j) = sign($y ) for all £ [m] x [k]. 

At times t > 1 the map irt will feature the occurrences of all variables that have been flipped thus far. 
That is, for any pair such that Walksat has flipped the variable at least once by time t, we 
let TTt{i,j) = ^ij. This information will be necessary for us to investigate the effect of flipping the same 
variable more than once. 

In addition, we need to pay particular attention to clauses that contain many variables that have been 
flipped at least once. The reason is that these clauses have 'too little randomness' left for a direct analysis, 
and thus we will need to study them separately. More precisely, in our map nt we will fully reveal all clauses 

in which at least ki = 0.49A: literals have been flipped at least once. Furthermore, we will also 
recursively reveal all clauses that contain at least A variables from clauses that were fully revealed before. 
This recursive process ensures that we can separate the analysis of clauses that are 'heavily conditioned' 
by the past steps of Walksat from the bulk of the formula. 

Throughout this process that mirrors the execution of Walksat, all variables whose occurrences have 
been revealed will be labeled either with an asterisk or with a zero. Those variables that got revealed 



PIO. If the assignment <Tt_i satisfies then the process terminates. 

PIl. Otherwise, choose an index it such that is unsatisfied under crt_i uniformly at random from the 
set of all such indices. In addition, choose jt G [k] uniformly at random. Define at : V —?' {0, 1} by 
letting crt(|*i,jj) = 1 - (Tt_i(|*,,jJ) and atix) = crt_i(a;) for all a; ^ l*»tjtl- 

PI2. Initially, let Zt ^ Zt-i andW* = M-i- 

While there is an index i G [m] \ Zt such that is {At-i U A/j U {l^i^jt |})-negative and either 

• there are at least fci indices j e [fc] with | | E At-i U or 

• there are more than A indices j G [k] with G A/j, 

add the least such index z,nin to Zt and add the variables | : J <= [k]} to Aft- 

PI3. Let At = {At-i U {|*HjJ}) \ 

Define the map tt^ : [m] x [k] { — 1, 1} U L by letting 

r if|*,,|€AuM, 

*^ ' 1^ sign($,y) otherwise. 

Figure 2: the construction of the maps irt 

because they occur either in a 'heavily conditioned' clause or in another clause that got revealed by the 
recursive process described in the previous paragraph will be labeled 0. All other variables that have been 
flipped by Walksat at least once are labeled *. We will let At denote the set of all variables labeled *, 
and A/j the set of all variables labeled 0. 

Let us now define the maps nt and the sets At , Mt formally. Each tt^ is a map [jti] x [fc] — > {— 1 , 1} U L, 
with L = {xi, ii, . . . , Xn, x„} the set of literals. As mentioned above, we let7ro(i,j) = sign($ij) for all 
e [m] X [fc]. Additionally, let Aq = Na = Zq = 0, and let o-q : V ^ , x ^ 1 be the all-true 

assignment. For a set 5 C we call a clause $i S-negative if for all j £ [fc] with sign($ij) = 1 we have 
^ij e S. For t > 1, we define the maps nt inductively via the process shown in Figure |2] Intuitively, the 
set Zt contains the clauses that are 'heavily conditioned' at time t, and At is the set of variables that occur 
in such clauses. Moreover, At is the set of all variables that have been flipped at least once by time t except 
the ones that belong to A/j . 

Let T be the stopping time of this process, i.e., the minimum t such that at satisfies $ (or oo if there is 
no such t). For t > T,we define tt^ = ttt, at = ar. At = At, Aft = Nt, and Zt = Zt- 

Steps PIO-PIl mirror the main loop of the Walksat algorithm; in particular, the stopping time T 
equals the total number of iterations of the main loop of Walksat before a satisfying assignment is found. 
The purpose of the remaining steps is to 'update' the sets At and Zt and the map tt^ as described above. 
Before we continue, it may be useful to illustrate the construction of the maps TTt with an example. 

Example 3.1 Let us go through the example of a 5-S AT formula with 6 clauses on 10 variables. For the 
sake of this example, we will work with ki = 2 and A = 2. (Recall that in our proof we actually assume 
that k > fc'o is large enough, ki = 0.49fc and A ~ \fk.) We will represent the maps ttj by tables whose 
columns correspond to the clauses Thus, the jth entry in column i represents the value TTt{i,j)- To 
improve readability, we just write + and — instead of ±1. Suppose that the initial map ttq, containing the 
signs of all literals, reads 

--- + + + 

- + - + - + 

TTO = -- -- - + 

---- + + 
---- + + 

The initial assignment ao is the all-true assignment, and Aq = Mo = Zq = (/}. Throughout, we will mark 
the variables in At by an asterisk * and the variables in Aft by a 0. 

Being all-negative, clauses #i and $3 are unsatisfied under ctq. Therefore, at time t = \ step PIl 
chooses ii € {IjS} randomly; say, the outcome is ii — I. In addition, PIl chooses ji € [fc] = 



{1,2,3,4,5} uniformly at random. Suppose the result is ji ~ 5. To carry on, we need to reveal the 
variable Thus far, the process has not imposed any conditioning on |$i5|, and therefore this vari- 

able is uniformly distributed over the set of all our n = 10 variables. Assume that indeed l^isl = Xi. 
Then PIl sets <7i{xi) = and cri(.T) = 1/or all x ^ xi. 

To implement PIl we need to reveal all occurrences of xi in our random formula. As there is no 
previous conditioning on any of variables \^ij\ with ^ (1, 5), these variables remain independently 
uniformly distributed over the set of all variables, and thus the events \ — x\ \ occur independently 

with probability 1/n. Suppose that xi occurs at the following positions: 

- - - Xl + + 

- Xl - + - + 

- - - - Xl + 

---- + + 

Xl - - - + Xl 

Then there is no clause with at least ki occurrences of a variable from Ao U Ap U {xi} = {xi}, and thus 
step PI2 is void. Hence, at the end of the first iteration we have Ai = {xi}, Afi = Zi = 0, and 



TTl = - - - - X^ + 
---- + + 
X^ ^ X-y^ 

At time t — 2 there are two unsatisfied clauses: $2, whose only positive literal got flipped to false, and 
#3, which was unsatisfied initially. Step PIl chooses one of them randomly, say 12 = 2, and also chooses 
a random position j2 € [k], say j2 = 2. As we already know from the first step, the literal in this position 
is $22 = ■'''1(2, 2) = Xl. In effect, the second iteration reverses the flip made in the first one and thus 02 
is the all-true assignment. Since we have revealed all the occurrences of Xi already, step PI2 is void and 

TT2 = TTl, A2 = {Xl}, andN'2 = Z2 

At the start of the third iteration the unsatisfied clauses are $1, $3. Suppose PIl chooses = 1 and 
j3 = 1. Then we need to reveal the variable |$ii|. At this point, the only conditioning imposed on this 
variable is that it is different from xi, because all occurrences of xi have been revealed already. Thus, 
|$ii| is uniformly distributed over X2, . . . , Xiq. Suppose that = X2. Then a3{x2) = anda3{x) = 1 

for all X 7^ X2. To reveal the occurrences of X2 all over the formula, note that by the same argument we 
applied to |$ii | all spots marked ± in 1^2 hide variables that are uniformly distributed over X2, . . . , Xio- 
Let us assume that X2 occurs in the following positions. 

X2 — - 2^1 + + 

- - + - + 

- - X2 - ^1 + 

- - - - + X2 

x\ - - - ^ X\ 

As clause $1 is A2 U M2 U \x2\ = {.ti, X2\-negative and contains ki = 2 occurrences of variables 
from A2 U {X2} = {.Ti , X2}, PI2 sets Z3 = {1}, reveals the remaining three variables in $1, and adds all 
variables that occur in $1 to Hz- Suppose that the remaining variables in $1 are |#i2| = X3, |#i3| = x^, 
l^isl = 2:5. Then Ms ~ {xi, .T2, X3, 0:4, xs}; in particular, xi,X2 are now labeled Q. The new label 
'overwrites' the * because PIS ensures that A3, = {A2 U {a;2}) \ As = 0. In order to carry out PI2, we 
need to reveal all occurrences of variables from As. Suppose this yields 

X2 X-y ^ ^ 

a;? - xo - + 
x° - - - X? + 

yO _ _ ^0 



Then clause $4 has become A2 U A/3 U {x2 } = {xi , . . . , 3:5 ^-negative ( as there is no +-sign left in column 
four), and thus PI2 sets Z3 = {1, 4}. To proceed, we need to reveal the remaining — sign of add the 
underlying variable to As, and reveal all of its occurrences. Suppose that this yields 



X2 Xi 

Xg ^4 — Xg 

^0 _ _ ^0 

X4 -X.Q 

^0 _ _ 

Xc Xa 



At this point PI2 stops, because clauses $5, $g have +-signs left and clauses $2, ^3 contain only one 
variable labeled 0. Thus, at the end of the third iteration we have A3 = 0, M3 ~ {xi , . . . , xg }, ^3 = {1,4}, 
and 

x^ - - x? + + 

Xg X4 — X5 — -f- 

TTs = xl - - Xq X° Xg 
;^;0 _ _ =0 ^0 ^0 

;j^0 =0 =0 ^0 ^0 

X4 Xc^ Xg X4 X^ 

As the fourth iteration commences, the only unsatisfied clause left is $3, whence 14 = 3. Moreover, 
assume that J4 = 1. As we have revealed all occurrences o/xi, . . . , xg, at this point we know that |$3i| 
is uniformly distributed over {xTjXsjXgjXio}. Suppose that indeed \^3i\ = xy. Thus, Pll sets a4{x2) = 
(T4(x7) = and (7i{x) ~ I for all x 7^ X2, X7. Suppose that revealing all occurrences ofxi yields 



x° 




X7 


Xj 


+ 


X7 


^3 


X4 




^0 
X5 


X7 


+ 


X4 






;p;0 
2^6 


X^ 




™0 
^5 






X4 


2^3 


•Xj'2 


™0 
Xi 




2^5 


2^3 


X4 


X4 



TTzen there are no Az U A/3 U {x^^-negative clauses with i ^ Z-^ that have at least two occurrences of 
a variable from Az U {xt}. Therefore, PI2 is void, and at the end of the fourth iteration we have 



;t;0 ^0 ^0 ^* _[_ 

3 1 5 7 ^ 

/TT" , /yiO 

774 — X4 X7 Xg X4 Xg , 



Ai = {2:7}, Mi = {xi, . . . , Xg}, and 2,^ ~ {1, 4}. As (T4 /i satisfying the process stops and T — 4. □ 



To trace the process PI0-PI3 over time we define a filtration {J-t)t>o by letting Tt be the cr-algebra 
generated by the random variables is,js and tt^ (i , j ) with s < t and (i , j ) G [m] x [fc] . Then intuitively, a 
random variable X is J"t -measurable if its value is determined by the first t steps of the process PI0-PI3. 
In particular, we have the following. 

Fact 3.2 For any t > 1, any x Cz V, and any i G [to] the events {at (x) — 1}, {^i is satisfied under at}, 
{x € At}, {i € Zt}, {x S A/f }, and {T = t} are J-t-measurable. 

Proof. The construction in steps PI2 and PI3 ensures that for any t > 1 we have ^i^jj G At U Aft and 
thus TTt{it,jt) = ^itjt This implies that for any variable x £ V the event {at{x) — 1} is J^t-measurable. 
In fact, we have at{x) = 1 iff the number |{1 < s < i : \TTt (is, Js) | = x}\ of times x has been flipped is 
even (because ao is the all- true assignment). 

This implies that for any i G [m] the event is satisfied under at } is J^t-measurable. In fact, if there 
is an index j E [k] such that nt{i,j) = 1, then #y is a positive literal whose underlying variable has not 
been flipped before, whence at satisfies Moreover, if there is an index j G [k] such that $y 7^ ±1, 



then by the previous paragraph the event that the literal $y = TTt{i,j) is true under at is J't -measurable. 
If there is such a satisfied literal ^ij, then $i is satisfied. Conversely, if there is no j e [k] such that either 
'^t{i,j) = 1 or Trt{i,j) is a literal that is satisfied under at, then clause $i is unsatisfied. Hence, the event 
{at is satisfying} is J^t -measurable as well, and therefore so is the event {T — t}. 

Furthermore, observe that i E Zt iff for all j e [k] we have TTt{i,j) ^ !}■ For if i G Zt, then 
for all j G [k] we have G A/j and thus iTt(i,j) = ^ ±1 due to PI3. Conversely, assuming that 
k > ko is large enough, any i G [k] such that TTt{i,j) <^ { — 1, 1} for all j G [k] must satisfy one of the 
two conditions that lead PI2 to add i to Zt- Hence, for any i G [m] the event {i G Zt} is J^t-measurable. 
As by construction A/f = {TTt{i,j) : i G Zt,j G [fc]}, we conclude that for any variable x E V the event 
{x G A/t} is J^t-measurable. 

Finally, the construction in PIS ensures that At = {{T^tiis, js)\ ■ ^ ^ s < t} \ Aft- As for any x the 
events {x G {\TTt{is, js)\ : 1 < s < t}} and {.t G A/j} are -measurable, so is the event {x G At}- □ 

If TTt{i,j) = ±1, then up to time t the process PI0-PI3 has only taken the sign of the literal $y into 
account, but has been oblivious to the underlying variable. The only conditioning is that | ^ AtU Aft 
(because otherwise PI3 would have replaced the ±1 by the actual literal). Since the input formula ^> is 
random, this implies that \ is uniformly distributed over V \ {At U A/f). In fact, for all such 
that TTt{i,j) = ±1 the underlying variables are independently uniformly distributed over V \ {At U A/j). 
Formally, we can state this key observation as follows. 

Fact 3.3 Let t > 0. Let £t be the set of all pairs {i, j) such that TTt{i, j) € { — 1, !}• The conditional joint 
distribution of the variables {\^ij\){i.j)££t given Tt is uniform over iy \ {At U Mt)f*. That is, for any 
map f : Et ^ V \ {At U Aft) we have 

P [V(z,j) G £t ■- = f{^,m = |F\ (A U A^oi-i^'i. 

Let 

T*^en with0=— . 

3fc 

Our overall goal is to prove that the stopping time of the process PI0-PI3 satisfies T < T* w.h.p. To 
prove this, we will define non-negative random variables St , Ht such that St + Ht = implies that at is a 
satisfying assignment. We will then trace St,Ht for 1 < t <T*- 
For any i > 1 let 

Vt = {i e [m] : *j is At U Wt-negative} . 

As PI3 ensures that is At U A/t-negative iff 7Tt{i,j) ^ 1 for all j G [k], the event {i G T>t} is Tt- 
measurable for any i G [771] . We define 

5*0 = |I?o| and St = |A| - |A| for t > 1. (6) 

Any clause with i ^ 2?t is satisfied under at- For if j G [k] is such that iTt{i,j) = 1, then is a 
positive literal and at{^ij) = 1, because Walksat starts with the all-true assignment ctq and the variable 
has not been flipped up to time t- Clearly, in order to study the random variable St it is crucial to 
estimate \T>t\- This is the purpose of the following proposition, whose proof we defer to Section|4] 

Proposition 3.4 W.h.p. we have \Vt \ < 2^~''mfor all t < T*. 

To define the random variables Ht, let us call an assignment t : Aft ^ {0, 1} rich for Zt if in each 
clause $i with i G at least 0.8fc literals are satisfied under r. 

Proposition 3.5 W.h.p. there is a sequence {Tt)i<t<T* with the following properties. 
L For any 1 <t < T*, Tt is a rich assignment for Zt. 
2. For any I < t < T* and any x G Aft-i we have Tt{x) = Tt-i{x). 



Assuming that there is a sequence {Tt)i<t<T* as in Proposition [33] we define Hq = and 

Ht = \{x e Aft : atix) ^ nix)}] for 1 < f < T*, 

and Ht ~ \J\ft \ for t > T*. For the sake of completeness, we also let Ht = \J\ft \ if there is no such sequence 
{Tt)i<t<T* ■ The proof of Proposition 13 .5 I hinges upon the following fact. 

Proposition 3.6 W.h.p. we have \Zt \ < en for all t < T*. 

We defer the proof of Proposition l3.6l to Section|5] Assuming Proposition l3.6l we can derive Proposition l3.5l 
rather easily. 

Proof of Proposition 15.51 (assumins Proposition 13.61 ). By Lemma l273l we may assume that $ has the 
expansion property ([T]i. Furthermore, by Proposition 13.61 we may assume that \Zt\ < en for all t < T*. 
Under these assumptions we will construct the sequence (r^ )i<t<T* by induction on t > 1. Thus, suppose 
that 1 < t <T* and that we have already got assignments with 1 < s < t that satisfy 1.-2. 

The set Z =- Zt \ Zt-\ of indices that Zt gained at time t has size \Z\ < \Zt\ < en. Therefore, ([T) 
ensures that there is a 0.9fc-fold matching M from Z to the set 

N^N{^z)^m.j\:{t.j)eZx[k]} 

of variables that occur in the clauses $i with i G Z. The construction in PI2 ensures that none of these 
clauses 4>i has more than A occurrences of a variable from Aft-i (as otherwise i £ Zt^i). Therefore, in 
the matching M' obtained from M by omitting all edges e = {i, x} with i £ Z and x e Aft-i each clause 
with i G Z is incident with at least 0.9fc — A > 0.8fc edges. Now, for each edge e = {i, x} £ M' let 
Tt{x) be the truth value that makes the corresponding literal in $i evaluate to true. Furthermore, for all 
y <S Mt-i let Tt{y) = Tt-i{y), and for all other variables x' e Mt let Tt{x') = 1. This ensures that Tt 
satisfies the two conditions in Proposition 13. 5 1 □ 

Having defined the random variables St,Ht, we are now going to verify that they suit their intended 
purpose, i.e., that St + Ht = implies that at is satisfying. 

Proposition 3.7 Let I < t < T*. If St + Ht = 0, then at is a satisfying assignment. 

Proof. Let Ut be the number of clause indices i G [m] \ £ Zt such that $i is unsatisfied under at. We 
claim that 

Ut < St ^ \Vt\ - \At\. (7) 

To see this, recall that any index i £ [m] such that 4>i is unsatisfied under at belongs to Vt. Therefore, to 
prove (|7]i it suffices to construct injective maps st : At —?' T^t such that for any x £ At the clause ^st{x) is 
satisfied under at. In fact, the map st will have the property that for each x £ At there is an index j £ [k] 
such that X ~ l^'^^j^.j^ | and such that the literal is true under at. 

The construction of the maps st is inductive. For t = we have Aq ^ 9 and thus there is nothing to 
do. Thus, suppose that 1 < t < T and that we have defined st-i already. Let y = l^i^j^ \ be the variable 
flipped at time t. If it ^ Zt, then y £ At and we define St{y) = it- Moreover, we let st{x) = st-^i{x) for 
all X £ At\ {y} C At-i. (Note that it is possible that y £ At-i as y may have been flipped before.) For 
t > T we set st = st-i. 

To verify that st has the desired properties, assume that T > t and observe that PIl ensures that 
was unsatisfied under at-i. Thus, it £ Pt-i C Vt. But as PIl sets at{y) = 1 — at^i{y), is satisfied 
under at. Furthermore, for all x £ At \ {y} we have at{x) = o't_i(x), and thus each of these variables 
contributes a true literal to its clause '^st{x) = ^'st_i(a;) by induction. Since st-i is injective but was 
unsatisfied under at-i, we have it ^ Im(st_i), whence st is injective. This establishes (|7]i. 

As (|7]) shows, 5*4 = impUes Ut = 0, i.e., at satisfies all clauses $j with i ^ Zt. To complete the 
proof, we need to show that if Ht = 0, then at also satisfies all clauses 4*; with i £ Zt. But if Ht = 0, 
then at{x) = Tt{x) for all x £ Aft, and Tt is a satisfying assignment of ^ 

Finally, we have all the pieces in place to prove Theorem II. 2 1 



Proof of Theorem U .2\ (assumins Propositions \3.4\ and \3M . Proposition l3.7l shows that 



P [T > T*] = P [T > T* A VI < i < T* : S't + i7t > 0] . 

We are going to bound the probability on the rh.s. To this end, we work with two random variables S[, H[ 
that are easier to analyze than the original St,Ht. Namely, we let S'q = Hq = 0, and 

^ ^ ^ [0 otherwise 

In other words, we let S'^ = Sf_i — 1 if the variable flipped at time t had not been flipped before and does 
not occur in any of the 'exceptional' clauses $2t_i- Otherwise, S'^ = <5'^_i. 
We claim that 

St < \Vt\+k \Zt\ + S't for any t > 0. (8) 

To see this, recall from (|6]l that St = {Vtl — \At\- By PI3, the set At contains all variables l^i^jj such 
that 7rs_i(is, js) = —1 with s < t, except the ones that belong to A/j. Since \Aft\ < fc we obtain (l8]l. 
Furthermore, we let Hq — and 

r -1 if e M-1 anda*(|*.,,J) = n(|*..,J), 

H^^H^_, + I 1 if|*,,,JeAA,_iandat(|*,,,J)^n(|*,uJ), > !)• 

[ otherwise 

Thus, starting at 0, we decrease the value of by one if the variable flipped at time t lies in A/j-i and its 
new value coincides with the 'ideal' assignment tj, while we increase by one if these values differ 
We claim that 

Ht < k\Zt\+ H[ for any t > 0. (9) 

For Hq = Hq and 

Ht-Ht-i = \{x Nt ■■ <yt{x) ^ Tt{x))\ - \{x Nt-i ■■ <yt-ii:x) n-i{x))\ 

< \Nt\Mt-i\+H[-H[_^<k\Zt\Zt-i\+H[-H[^^ foranyt>l. 

Combining ([8]) and ^ with Propositions 13 .41 and 13.61 we see that w.h.p. 

St+Ht < \Vt\ + 2k\Zt\ + S't + H't 

< + 2k\Zt\+S^ + H'^<^ + en + S^ + H'^ for any t<T*. (10) 

Hence, we are left to analyze S't + H[. 

Basically, 5^ + H[ is a random walk with a negative drift. More precisely, we claim that 

E [S't + H[\Ft-i\ < S't'_i + - i for ah t < min {T, T*} . (1 1) 

There are two cases to consider 

Case 1: it ^ -Zt-i- The construction in step PI2 ensures that there are fewer than A indices j such that 
\^itj\ G A/j-i. Furthermore, PI2 ensures that there are less than fci indices j such that G 
At~i- Moreover, there is no index j such that TTt-i{it,j) = 1, because otherwise clause would 
have been satisfied under (Tf_i. This means that for at least k — ki — X indices j e [k] we have 
Trt-i{it,j) = — 1. Therefore, as jt G [k] is chosen uniformly at random, with probability at least 
1 — (fci + A)/fc > 0.51 we have = St_i — l. In addition, as contains at most A variables from 
Aft-i, the probabihty that H^ = H[_^ + 1 is bounded from above by \/k < 0.01. Thus, (EJ holds. 



Case 2: it e Zt-i 



As the assignment Tt-i is rich, there are at least 0.8fc indices j such that Tt{^i^j) = 
1. However, for all of these indices j we have at-ii^i^j) = 0, because is 
unsatisfied under at-i- Hence, the probability that Tt{^i^j^) = 1 and at-i{^itjt) = is at least 
0.8, and if this event indeed occurs then (Jt{^i^j^) ~ Tt{^i^j^) = 1. Therefore, Hj. — Hf_^ has 
expectation < —0.8 + 0.2 < —0.6. Moreover, S'^ < St_i with certainty. This implies ( fTTT i. 



To complete the proof, we are going to apply Azuma's inequality (Lemma 12.21 in Section |2]i to the 
random variable S'rp, + H^,. The inequality applies because (fTTl i shows that {S'f + H[)t>Q is a super- 
martingale. However, there is a minor technical intricacy: to use the inequality, we need an upper bound on 
the expectation E[S'rpt + H!p,]. But as (fTTI) onlv holds for t < min {T, T*}, this would require knowledge 
of the probability that T > T*, the very quantity that we want to estimate. 

To circumvent this problem, we define further random variables Rt by letting Rt = S'^ + H[ for 
t < min{T*,T} and Rt ^ Rt-i - 1 for t > min {T*,T}. Then i?o = and E [i?t|J't-i] < Rt-i - \ 
for all t > 0. Thus, E \Rt-\ < -T*/2. Recalling the definition Q of e, we obtain for k > ko sufficiently 
large and p < po = 1/25 sufficiently small the bound E [Rt*] < — T*/2 < —Apn/k - en — i}{n). 
Furthermore, \Rt — Rt-i\ < 2 for alH > by the definitions of S't, Therefore, Azuma's inequality 
yields 



P [i?T* > -4/97i/fc- en] < P i?r* > E[i?T*] 
Finally, we obtain from ([Sj, (|9]l, and Proposition 13 .7 1 



,2/3 



< exp 



ST* 



= o(l). 



(12) 



p[r>r*] < p[v<<r* : |A| + 2fc|Zt| + i?t >o] <p[|i?T* I + 2fc|ZT. I + i?T* >o] 

< P [\Vt' I + 2k \Zt' I > pn/k + en] + P [Rt* > -Apn/k - en] o(l), 

thereby completing the proof. □ 

Our remaining task is to estabhsh Propositions I3.4l and 13.61 From a formal point of view, we should 
start with Proposition 13.61 because the proof of Propositions 13.41 depends on it. However, the argument 
that is used in the proof of Propositions 13 .4l is conceptually similar to but technically far simpler than the 
one that we use to prove Proposition 13.61 Hence, for didactical reasons we will start with the proof of 
Propositions 13. 4l in Section|4]and postpone the proof of Proposition l3.6l to Section|5] 



4 Proof of Proposition \3A 



In this section we keep the notation and the assumptions from Proposition \3.4\ 

Our goal is to bound the number [Vt* \ of At- U Nt- -negative clauses i.e., clauses whose positive 
literals all belong to At* U Nt* ■ Thus, we need to study how the process PI0-PI3 'hits' the positions 
(i, j) e [m] X [fc] that represent positive literals by adding their underlying variable to At* U Nt* ■ To this 
end, we consider the two random variables 

- W (13) 



otherwise, 

.Of _ I 1 if 7rt_i(i, 1 and e 



* ^ '-^^ \ otherwise, ^ ' 

for any £ [m] x [k] and t > 1. Recall that TTt-i{i,j) — sign($ij) iff is a literal such that 

^ At-i U A/f-i (cf. PI3). To simplify the notation, we define for a set I C [m] x [k] 

K*t{I)= n K*t{i,jl K?{I)^ n Kn^,J). 
If I* , I" C [m] X [fc] are both non-empty, then 



(15) 



Indeed, suppose that ii'°(X'') ^ 0. Then PI2 must have added at least one clause to Zj. But the construction 
in PI2 ensures that the first clause that gets added to Zt contains the variable | ^i^j^ \ flipped at time t. Thus, 
At C At-\ by PI3, and thus there cannot be a pair with Kl(i^ j) = 0. In effect, K^{I*) = 0. 

Lemma 4.1 Let t > 1 and 9 ^ I* C [m] x [k]. Let £*{!■*) be the event that = ^ 

At-i UA/t_i, and{it,jt) ^T*. Then 

F[£:{I*)\Ft-i] < max{l,|y\(A-iUM_i)|}-l^*l. 

Proof. Since clause is unsatisfied under dt-i, is At-i U A/j-i-negative and thus 7Tt-i{it,jt) 1- 
Hence, PI3 ensures that either |4*itjJ G At-i U Mt-i or 7rt-i{it,jt) = If £ti^*) occurs, then 
l^itjt I ^ At-i U Aft-i and thus Trt-i{it,jt) = —1- Furthermore, if Z* occurs, then \ ^ At-i U Nt-i 
for all € I*, and thus 7rt_i(i,j) e {-1,1} by PI3. Thus, by Fact [33] | J and with 

G X* are independently uniformly distributed over V \ [At-i U A/j-i). Therefore, 

F[£:{r)\Tt-i] < max{l,|l/\(A-iUM-i)|}"'^*' , (16) 
whence the assertion follows. □ 

Corollary 4.2 For any t > 1, I* d [m] x [k] we have 

E[K:{r)\Tt-i] < max{l,|y\(A-iUM_i)|}-l^*l. 
Proof. If Y[(^i j")6i* ^tihj) = 1' then the event £*{!*) occurs. Hence, Lemma|4T|implies that 



E 



< P[f;(r)|J-t_i] <max{l,|\/\(A-iUA/;_i)|}-'^*l, (17) 



as claimed. □ 
Lemma 4.3 For any t > 1, St > and 1° C [m] x [k] we have 

1-7-01 



E [K°{I°) ■\{\Zt \ Zt^,\ < 5t} l-Ft-i] < 



max{l, |^\ (A-i UM-i)! - kSt} 



Proof. We may assume that I" ^ 0. We may also assume that TTt-i{i,j) = 1 for all G I" as 

otherwise /^"(i, j) = for all (i, j) e 1°. We are going to work with the conditional distribution 

p[-]=P[-\J^t-i]. 

Let be the event that = 1 and \Zt \ Zt-i\ < St- Then our goal is to estimate p . 

If the event occurs, then TTt-i{it,jt) = —1 and l^i^jj G Aft- Indeed, being unsatisfied under 
the assignment (Tt_i, clause is At-i U A/j-i -negative, and thus TTt-i{it, ,jt) 7^ 1- Furthermore, if 
■jTt-i{it,jt) = ^itjf then G At-i U M-i by P13, and thus Zt = Zt-i and Aft = Nt-i by the 

construction in step P12. But if M = Aft-i, then K^{T°) by definition. 

Thus, assume that Tit-i{itijt) = —1 and l^i^ | € A/*. We need to trace the process described in PI2 
that enhances the sets Aft and Zt- This process may add a sequence of clause indices to the set Zt and 
the variables that occur in these clauses to Aft- As these variables get added to the set Aft one by one, we 
will study the probability that they occur in one of the positions (i, j) S The first clause that P12 adds 
to Zt necessarily contains the newly flipped variable |, and thus we may assume that this is the first 
variable that gets added to A/f. In addition, if |Zf \ Zf_i| < (5t, P12 may add up to fc(5t — 1 further variables 
to Aft- To track this process, we need a bit of notation. 



Let si, . . . , Sy he the clause indices that PI2 adds to Zt, in the order in which they get added by the 
process. Let y* = min {y, 5t}- For each 1 < i < y* let 1 < < • • • < jij. < fc be the unique sequence 
of indices such that 7rt_i(si, = —1 and 

i-l 

l*s.,.J ^ {|*.,,J} UM_i U y iV(*,J U {I*..,, J -.uKq} for all q < 1,. 

This means that {l^s^j^ , | : 1 < 9 < ^i} are the new variables that contributes to Mt and that did not 
belong to At~\ already. Let — l^itjj ™d 1st • ■ • ? the sequence of variables \^siji \ with 

q= 1, . . . , and i = 1, . . . , y*. Hence, ^o, ■ • • i is the sequence of variables not in At-\ that PI2 adds 
to A/j, in the order in which the process adds these variables to Mt- By our choice of y* , the total number 
of these variables satisfies 

L + l<ky* < k6t. 

Of course, L and ^o, ■ • ■ i are random variables. 

If £° occurs, then each of the variables with G occurs in the sequence ^o, • ■ • 7 ^l- Hence, 
there exists a map / : — )■ {0, 1, . . . , kSt — 1} such that f{i,j) < L and $y = ^f(i,j) for all (i, S X°. 
For a given / let S^{f) denote this event. Then by the union bound, 

p[E'] < p [£«(/)] <(M.)I^\ max (18) 

We claim that 

p[£°{f)] < max{l,|y\(A-iUM-i)|-fc'5J-l^°l (19) 

for any /. To prove (fT9] l. let Z° = f~^{l) be the set of positions E iP where the variable occurs 
(0 < Z < L). Moreover, let £"(/) be the event that 

a. = for all (i, j) G I", and 

b. *y ^e^forall(^,J)GIO\I^'. 

As TTt-i{i,j) ~ 1 for all G I", given J^t_i the variables with (i, j) G If are independently 

uniformly distributed over V \ {At-i U A/f_i) by Fact 13.31 Hence, given the event C\^^i£f{f), the 
variables | with G 1° are uniformly distributed over the set V \ {At~i U Mt-i U {^O: • • ■ : 6-1}) 
(for if f °(/) occurs for some v <l, then 7^ for all [i,]) G Therefore, we obtain 



V<1 



< max{l,|V^\(A-iUM-i)|-^ + l} foranyO</<L 



Multiplying these conditional probabilities up for < I < L < kSt, we obtain (fT9] l. Finally, combin- 
ing ( fTSI l, ( fTSl l, and ( fT9] l completes the proof. □ 

Corollary 4.4 For any t > 1, 5t > fl«c/ 1* , C [m] x [fc] we have 

^[K;{I*)Kf{l'')\{\Zt\Zt_^\<5t}\:Ft-i] < max{l,|V^\(A-iUM-i)|}"'^'' 

fc(5. 



^max{l, \ (A-i UM-i)| - fc^t}. 

Proof. This is immediate from ( fTSl l and Corollary 14.2 l and Lemma |43] □ 
We will now establish the following. 

Proposition 4.5 W.h.p. we have either Zt* > en or \'Dt* \ < 



Proof. Let £ be the event that |Zt- | < en but I^t- | > ^^"^m. Our goal is to show that P [£] = o(l). To 
this end, we will decompose £ into various 'sub-events' that are sufficiently detailed for us to bound their 
probabilities via Corollary 14.41 In order to bound the probability of £ we will then use the union bound. 

As a first step, we need to decompose £ according to the sequence {\Zt \ Zt_i|)t>i of increments of 
the sets Zt. More precisely, let A be the set of all sequences 8 = {5t)i<t<T' of non-negative integers with 
Ef=i < en. Let £{S) be the event that \Zt \ Zt-i \ < St for all 1 < t < T* and |I?t- | > 2'^^''m. If the 
event £ occurs, then there is a sequence S such that the event £{S) occurs. Hence, by the union bound 

P[£]<^P[f(^)]<|A|.maxP[£(<5)]. 

SeA 

As it is well known that |A| = ("-^'^^i*) = (Tr^K*)' "^^ 

Fixing any sequence (5 e A, we now decompose the event £{6) further according to the precise set M 
of clauses that end up in Vt' , and according to the precise 'reason' why each clause i E M belongs to 
2?T* ■ More precisely, let M C [m] be a set of size /i = 2^~^nn. Moreover, for disjoint Q* ,Q''^ C AI x [k] 
let £oiQ*,Q°) be the event that 

Mhj) = 1 for (i, j) eQ*U Q°, while 7ro(i,i) = -1 for £ M x [k] \ (Q* U Q°). 

Furthermore, for maps t* : Q* ^ [T*],t° : g" ^ [T*] let 5(5, t*, rO) be the event that |Zt \ Zt_i| < St 
for all 1 < i < T* and 

T^r'{i,j)-i{iJ) = 1 while *y e for all e Q* , 

7i"rO(ij)-i(«, j) = 1 while *y e Mt«{L]) for all (i, e Q°. 

If the event £{S) occurs, then there exist Q*,Q'^ and t*,t° such that the events £o{Q*,Q'^) and 
£{6,T* ,t'^) occur In fact, if £{S) occurs, then |I>t*| > Thus, select a subset M C I?t* of size 
/i. By the definition of 2?t' , each i G M is At* U A/t* -negative. Thus, for any j G [k] such that is 
a positive literal there is a time 1 < ^ = t{hj) < T* such that 7rf_i(i, j) = 1 but TTt{i,j) £ AtU Mt. 
If Trt{i,j) E At(i.j), then include (i, j) in Q* and set T*{i,j) = t. Otherwise, add {i,j) to and let 
T^ih j) = ^- Then indeed both £q{Q* , Q") and £{6, t* , r°) occur Thus, by the union bound, 

P[£iS)]< P[£oiQ*,Q")n£i5,T*,T°)]. (21) 

The event £o{Q* , Q") depends only on the signs of the literals and is therefore J'q -measurable. Fur- 
thermore, as signs of the literals $y are mutually independent, we get 

P[£o{Q*,Q°)] =2-'=r 

Therefore, (|211 yields 

P [£:(<5)] < 2-^^ J2 P[£{5,T*,r°)\:Fo]. (22) 

Thus, we are left to estimate P[£{S, t*,t°)\To] . 

We defined the random variables K^{-, •), K^{-, ■) so that if the event £{S, r*, r°) occurs, then 

n Kr'i^^J){^,J)■ n K,o^^.J){hJ)■l[^{\Zt\Zt-l\<St}^l. 



In order to apply Corollary l4.4l to the above expression, we are going to reorder the product according to 
the time parameter More precisely, let = t* ^^{t) and = r° (t). Then 



P[£:(5,T*,T°)|J-o] < E 



E 



n Kr-H^Mhj) n K,0(^,^^)il,j)l[l{\Zt-Zt-l\<dt}^l\To 

l[KtiQ*t)KtiQ"t) ■lllZtXZt-il < St} = i|J-o 



.t=l 



If \Zt \Zt-i\< 5t for all t < T\ then IM-i | + kSt < k J2,<t St < ken for all t <T*. Furthermore, 
\At\<t<T* = O.ln/fcforalH > 0. Hence, \V\ {At^iUM-i)\~ kSt > 7i(l - fee- 0.1/fc) > n/1.01 
for all t < T*, provided that fc > fco is large enough. Thus, Corollary 14.41 entails in combination with 
Lemma l2.1l 



(23) 



(ij)eQ° 

For any M C [m] of size /i and any two disjoint Q* , Q'^ C M x [k] let 



5(M,Q*,g")= 



1.01 



n 



with the sum ranging over all maps t* : Q* [T*], t° : ^ [T*]. Let 9 = T*/n. As Et<T* < en, 
we obtain 



5(A/,Q*,Q0) < 



I.OIT* 



I.OIT' 



IQ*I 



\Q'\ 



l.Olfc 



l.Olfc 



, IQ"l 



\Q"\ /T 



Y^St] < (1.010)1^*1 (l.Olefc)"^ I . (24) 



Combining (l22l i. ( l23T l. and (l24l i. we thus get for any ^ e A 

P[£{d)] < 2-^^ E S{M,Q*,Q') 

MC[m]:|A/|=MQ':Q°CMx[fe]:Q'nQO=0 



< 2 



— kfj. 



< 2 



< 



Y E (l.OW)'* (l.Olefc)'" 

9*,90:g*+gO<fc/iQ*,Q«:|Q*|=gMQO|=g" 

k^L 



E 

^1 + im{e + imke)\ 



g*, kfi — q* — 



J(1.016l)'' (l.Olfe)'^ 



) 



< 



cm 



1 + 1.01(61+ 1.01fc£) y 
2 j 



< 



62" 



fc_2 A + 1.01(0 + 1.01A:e)y 



(25) 



provided that k > ko is sufficiently big. Finally, combining (|20] | and (|25] |. we obtain 



F[£] < 



en 



en 



(I) <Hl + e/e)rQ . (26) 



By our assumption that p > k (cf. the first paragraph in Section |3), we have ^ ~ 2^ > pn/k > 
k~^n. Hence, recalling that 6 < 1/k and e = cxp(— fc^/'^) (cf. Q), we obtain from ( 



P[£] < eyii>[n{e\n{2/e)-k-'^)] <cxp n(keyi-p{-k^/^)-k-^lii{c/3)j = cxp(-rj(n)) = o(l), 



provided that k > ko is sufficiently large. 

Finally, Proposition l3.4l is immediate from Propositions 13 . 61 and l4.5l 



□ 



5 Proof of Proposition 1X6 



Throughout this section we keep the notation and the assumptions of Proposition 

5.1 Outline 

The goal in this section is to bound the size of the set Zt- ■ There are two reasons why step PI2 may add 
a clause index i G [m] to the set Zt for some 1 < t < T* . First, the clause 4>i may feature at least fci 
variables from the set At-i U{\^i^j^ |}, i.e., variables that have been flipped at least once. Second, $i may 
contain at least A variables that also occur in clauses that were added to Zt previously. The key issue is to 
deal with the first case. Once that is done, we can bound the number of clauses that get included for the 
second reason via Lemma l24l i.e., via the expansion properties of the random formula. 

Thus, we need to investigate how a clause comes to contain a lot of variables from At-i U { l^itjj |} 
for some t < T*. There are two ways in which this may occur First, Walk sat may have tried to satisfy 

'actively' several times, i.e., is = i for several s < t. Second, $i may contain several of the variables 
l^isjJ flipped at times s < t 'accidentally', i.e., without Walksat trying to actively satisfy i. More 
precisely, for any t > we call a pair G [m] x [fc] 

• t-active if there is 1 < s < t such that = {is,js) and TTs-i{i,j) = —1. 

• t-passive if there is 1 < s < i such that ^ (*s,is) but ~ \^isjs\ ^i^^ 7Ts-i{i,j) G 



Furthermore, we say that i e [m] is t-active if there are k2 — 0.48A: indices j such that is t-active. 
Similarly, we say that i is t-passive if there are ^3 = O.Olfc indices j such that is t-passive. These 
definitions ensure that any i G [m] for which there are at least fci indices j G [k] such that | G 
At-i U {l^itjj} is either t-active or t-passive. 

To prove Proposition 13.61 we will deal separately with t-active and t-passive clause. Let At be the 
number of t-active clauses, and let Pt be the number of t-passive clauses. 

Lemma 5.1 For any 1 <t <T* we have P [At < en/4 V \Zt-i \ > en] > 1 - l/n^. 

We defer the proof of Lemma ISTI to Section l572l 

Lemma 5.2 For any 1 <t<T* we have P [Pt < en/A V |Zt_i| > en] > I - \/n^. 
Proof. As in the proof of Proposition |43] we are going to break the event of interest, i.e.. 



down into sub-events whose probabilities can be estimated via Lemma |4T| Then we will use the union 
bound to estimate the probability of E. 

For a set M C [m] of /i = en/4 clause indices let £{M) be the event that \Zt-i[ < en and all i G M 
are t-passive. If £ occurs, then there is a set M such that the event £{M) occurs. Hence, by the union 
bound 



Thus, fix a set M C [m] of size /i. Let Q C M x [k] be a set such that for each i G M there are 
precisely k^ indices j G [fc] such that (i, j) G Q. Let £{M, Q) be the event that [Zt-i | < en and all pairs 



{-1,1}- 



£ = {Pt>en/4A[Zt^i[ < en} , 




(27) 



€ Q are t-passive. If the event £{M) occurs, then there exists a set Q such that £{M, Q) occurs. 
Therefore, again by the union bound 



(28) 



P [£{M)] < ^ P [£iM, Q)] < " max P [f (A/, Q)] . 

For a map r : Q — > [f] let £{M, Q, r) be the event that \Zt-i\ < en and 

T{i,j) — niinjs G [t] : is s-passive} for all G Q. 

If the event £{M, Q) occurs, then there is a map r such that the event £{M, Q, r) occurs. Consequently, 
for any M, Q we have 



V[£{M,Q)] < ^P[f(M,Q,T)] <<l'5lmaxP[f(A/,g,r) 

r 

Combining dZTl l. ( l28T l, and ( |29] |, we see that 



P[Pt>Ai] < 



m\ ( k 



max V[£{M,Q,t)] 



M.Q.T 



(29) 



(30) 



Hence, fix any M,Q,t. Let Qs = t ^{s) for any I < s < t, and let £*{Qs) be the event that 
| = ^ A-i U7Vt_i for all e Qs, and {it,jt) ^ Qs- If £{M,Q,t), then the events 

£*{Qs) occur for all 1 < s < i. Moreover, the construction PI0-PI3 ensures that \As\ < s, and that 
|A/'s-i| < fc|Zs_i| < fcenforall 1 < s < ^- Therefore. Lemma l4.1 [ implies 



P[£:(A/, Q,t)] < P 



f]£:iQs)n{\Ms-i\<ken} 



< J|max{l,n-s + l-fc£n} '^^^'(31) 



As s < ^ < 2^* < n/k, e ^ cxp(— fc-^/'^), and because we are assuming that k > ko is sufficiently large, 
we have n — s + 1 — ken > n/1.001. Hence, dSTT i yields 



P[£{M,Q,t)] < ]^max{l,n-s + l-fen}"''^''' < (1.001/n) 



(32) 



Finally, combining dSOb and ( l32b and letting 6' = T* /n, we get 



m 



em 



LOOlefceiy 



< 



As 6 = 0.1/A:, we have l.QQlckO < 1. Since furthermore k^ ~ fc/lOO, we obtain for k > ko large enough 



P[£] < 



ek 



< exp{-fi) = o(l), 



thereby completing the proof. □ 

Proof of Proposition 15.61 In order to bound \Zt\ for < t < T*, we are going to consider a superset 
yt D Zt whose size is easier to estimate. To define yt, we let y^ be the set of all i that are either t-active 
or f-passive. Now, yt is the outcome of the following process. 

Initially, let =3^;. 

While there is a clause i e [m]\ yt such that \{j e [k] : e N{^yJ}\ > A, add i to yt. 
Comparing the above process with the construction in PI2, we see that indeed 

yt^Zt. (33) 



Also note that yt D 3^t-i for alH > 1. 

To bound \yt\, we proceed by induction on t. Let Yt be the event that either the random formula ^ 
violates the property Q, or |3^f | > en. We claim that P [Yq] = o(l) and that 

P [Yt] < P [Yt-i] + 2n-^ for all 1 < t < T* . (34) 

Since trivially J^o = 0- is simply the event # violates ([3). Hence, Lemma l24l shows directly that 

P ^o] = o(l). (35) 

Now, consider some 1 < t < T*. Lemmas |5 . 1 1 and \52\ show that 

P[At + Pt < en/2V \Zt-i \ > en] > 1 - 

Furthermore, if Yt-i does not occur, then we know that \Zt-i\ < |3^t-i| < sn and that (O is satisfied. If 
in addition At + Pt < en/ 2, then Q ensures that |3^t| < sn, and thus Yt does not occur Therefore, 

P [Yt] = P [Yt-i] +P[Yt\ Yt-i] < P [Yt-i] +P[At + Pt> 2e^n A [Zt-i[ < en] < P [Yt^i] + 2/n\ 

Finally, dUi and ^ yield 

T* 

P [|3^T- 1 > en] < P [Yt.] < P [Yq] + 2/n^ = o(l) + 2T*/n^ = o(l). 
In combination with ( l33T l, this implies the assertion. □ 



5.2 Proof of Lemma 15.11 

How can a clause $i become t-active? If this occurs, then Walksat must have tried 'actively' to satisfy 
at least k2 times by flipping one of its variables. But each time, the variable that Walksat flipped to 
satisfy got flipped again because flipping it rendered another clause unsatisfied. 

More precisely, if #i is t-active, then there exist distinct 'slots' ji, . . . , jk^ G [k] and times si , . . . , s^^ £ 
[t] such that is s;-active for I = 1, . . . , ^2. This means that at the times s;, Walksat actively tried 
to satisfy #i by flipping j^y, | (L = 1, . . . , fe2)- However, as Walksat had to make k2 attempts, each 
of the variables | with I < k2 must have been flipped once more by time s;+i. Hence, [^ij^ [ occurs 
positively in a clause that is unsatisfied at some time si < qi < s;+i. In particular, hi G I?g, C Vt- 

Thus, in order to prove Lemma ISTI we are going to bound the probability that there are at least en/4 
clauses $i that admit ji, . . . , G [k] such that for each 1 < I < k2 there is another clause with the 
following properties. 

Al. We have sign($y, ) = — 1, and there is an index j € [k] such that sign($/i,j) = 1 and ^hij = [^iji I- 

A2. hi <E Vt, i.e., is At U A/j-negative. 

In order to deal with Al we will need to refine our filtration. Given a subset Q C [m] x [k] and a map 
g : Q ^ [to] X [k] , we let ilg be the event that 

sign(*,y) = -1, sign(*g(,_j)) = 1 and = |*g(»,j)| foraU € Q. 

Since the literals of the random formula # are independently uniformly distributed, we see that 

P[f^3] <2-l«^s(Q)l„-|QI. (36) 

We consider Vlg as a probability space equipped with the uniform distribution (in other words, we are going 
to condition on D,g). Further, we define a filtration {J-g.t)t>o on Vtg by letting Fg^t = H fig : £ e Ft}- 
In other words, Fg^t is the projection of Ft onto ilg. Hence, Fact l3.2l directlv implies the following. 



Fact 5.3 For any t > 0, any x & V, and any i € [to] the events {at{x) = 1}, is satisfied under (Jt\, 
{x G At}, {i G Zt\, {x G Mt\, and {T = i\ are Tgj-measurable. 

Moreover, since the only conditioning we impose in ilg concerns the Uterals $y with (i, j ) G QUg{Q), 
Fact [33] yields the following. 

Fact 5.4 Let t > 0. Let £t be the set of all pairs G [to] x [k] \ (Q U g{Q)) such that T^tihj) G 

{—1,1}. The conditional joint distribution of the variables {\^ij\){i.j)e£t given Tt,g is uniform over 
{V\ {AtVNt)Y*. That is, for any map f : £t ^ V \ {At U Mt) we have 

p [V(z,j) G Et ■■ = /(^,j)l-^t,<,] = (A uM)|-i^'i. 

Similarly, with respect to the random variables K^{-, •) and •) defined in ( fT3l l and flAi . Corol- 

lary |43]implies the following. 

Corollary 5.5 For any t > 1, St > and I* , 1° C [m] x [k] \ (Q U g{Q)) we have 

E [k: {r)K° {\Zt \ Zt-i\ < StjlJ'g.t-i] < max{l,|y\(A-iUA/;_i)|}-l^*l 

/ kSt 

■ \ma.x{l,\V\iAt-iUAft-i)\-kSt} 

As a further preparation, we need the following lemma. 

Lemma 5.6 Let 1 < t < T*. Set ^ = en/4 and let M C [to] be a set of size \M\ = fi. Furthermore, let 
Q <Z M X [k], let I C [to] be a set of size \I\ < \Q\, and let g : Q ^ I x [k]. Let £{M, Q, I, g) denote the 
event that \Zt\ < sn and the following three statements hold. 

a. For all (i, j) Cz Q we have sign($ij) = —1, sign($g(j ) ~ 1, and ^g(^i j^ = \. 

b. Id Vt. 

c. For each i Cz I there is j G [k] such that {i, j) G g{Q)- 
ThenP[£{M,Q,I,g)] < 2(7,+-) (2n)-IQl2-^l^l . 

Proof. To estimate P [£{M, Q, /, g)], we need to decompose the event £(M, Q, I, g) into 'more detailed' 
sub-events whose probabilities can be bounded directly via Corollarv l5.5l To this end, let I* , 1° be two dis- 
joint subsets of /x [/s]\g(g), and let T -.1* [T*],t° : J°-)^ [T*]betwomaps. Let£:(M,Q, J,g,r,t°) 
be the event that |Zt | < en and that the following statements are true. 

a. For all G Q we have sign(*jj) -1, sign(*g(j^j)) = 1, and = 

b. i. If G / X [A:] \ (.g(Q) U I* U 1°), then sign(*ij) = -1. 

a. If (i, j) G I*, then sign(*,y) = TTf(ij)-i{i,j) = 1 and *y G Af(ij). 
Hi. If (i, j) G 2:°, then sign(*y) ^ ■^tO{i,j)^i{h j) = 1 and G AftO[i,j). 

c. For each i G / there is j G [fc] such that (i, j) G g{Q)- 

If the event £{M, Q, I, g) occurs, then there exist 1*,!°, T, i° such that the event £{M, Q, I, g, t*, t°) 
occurs. Indeed, the definition of the set Dt* is such that if i G Vt* , then for any (i, j) Cz I x [k] such that 
sign($ij) = 1 we have G At' U A/t* . Thus, by the union bound, 

F[£{M,Q,I,g)] < ^ ^P[£(A/,Q,/,.g,r,tO)] . (37) 

X*,X° t',tO 




Furthermore, let S = {Sq, . . . ,St) he a sequence such that X]*=i < Let £{S, M, Q, /, g, <*, 
be the event that \Zs \ Zs-i\ < Ss for all 1 < s < t and that £{M, Q, I, g, t* ,t^) occurs. Then by the 
union bound. 



P[f(il/,Q,/,5,r,tO)] < ^P[£(^,A/,Q,/,5,r,tO)] 



< 



^* + maxP [£{8, M, Q, I, g, t*,t') 



en - 



(38) 



The event £{5, M, Q, I, g, t*,t^) is sufficiently specific so that we can estimate its probabiUty easily. 
Namely, if £{6, M, Q, I, g, t*,t^) occurs, then ilg occurs and 



n I<tH^,J)i'^J) n Kt,^.,,)it,j)Y[i{\zs\zs-i\<s,}=i. 



(39) 



i5=l 



To bound the probability that (|39] | occurs, we reorder the product by the time parameter That is, letting 

I* =t*-^{s),l'^ = i"-i(s),we get 

P[£:(^,M,Q,/,.9,r,iO)|J-3,o] 

t 

< E n n Kt.^,^,^{t,j)'[[l{\Zs\Zs.i\<6.} = l\J^g^o 

(i,i)ei* (i,i)ei" s=i 

t 

n A7(i:)A',"(Z,")l {|Z, \ < 5,} I . (40) 



< E 



Since for any s < t < T* we have \As\ < s < T* < O.ln/fc, and as lA/^l < fc^^g^i Ss < ken, we see 
that \As UMs\ + kSs < O.OOln for all s < t. Hence, (00)1 and Corollary |53]yield 

F[£(5,M,Q,I,g,t*,e)\J^,.o] < IT (^) ' (^) 

- n ^^^-c^.)- (41) 

Furthermore, if the event £{S, M, Q, I, g, t*,t°) occurs, then for all (i, j) e I x [k] \ {g{Q) U I* U 1°) 
we have sign($ij) = —1. This event is J"o,g -measurable. Hence, as the signs of the literals are 
independently uniformly distributed, we obtain from (|411 



I I _|_ 12^*^ I 

V[£{d,M,QJ,gX,n^,] < 2-l^xW\(«('3)u^-^^°)l (i^) J] ^^^^(^J) 

|I*UI"| 



(i,j)eio 



Combining (l36T l and (l42T i. we get 



P[f(5,A/,g,/,.g,r,t")] = P[l]<,]P[£:(5,M,g,/,g,r,tO)|f] 



9J 

< (2n)-l'2l2-l^xWl(^)'^"^' n kS,o^^.)- (43) 



As i37\ and ( |38] | show, in order to obtain P [£{M,Q, I, g)], we need to sum (l43l l over all possible 
choices of 5,I*,I^r,i°: 

P[£iM,Q,I,g)] < ri\'^)i2n)-^'^^ E E ? ('5, M, Q, /, 5, i*, O] 

^ QT)p»)-""^-"'""i:(^)"'""'''^'' e n *^.-...) 



< 



T* + en 
en+ 1 



because X)s=i '^^ — '■^^ l^^'- ^'^'^ decays geometrically with respect to \2* U 1° |, we get 

P[£(A/,g,/,g)] < 2r*+7V2n)-l«l2-l^xWI, 
\ en+ I ) 

as desired. □ 

Proof of Lemma 1377] Let ^ = en/4 and fix some \ < t < T* . Let £" be the event that \Zt\ < en and 
At > n. For a set Af C [m] of size |A/| = /i we let f (Af) signify the event that all clauses i g Ai^ are 
t-active. If £ occurs, then there is a set A/ of size /i such that £{M) occurs. Hence, by the union bound 

P[f] < P ['^'(^^)] < (^™^ maxP (M)] . (44) 



MC[m]:\M\=fi 



To bound the expression on the rh.s., fix some set A/ C [?n] of size /i. Let Q(A/) be the set of all 
Q C A/ X [A;] such that for each i e Af we have | {j e [k] : (i, j) e Q} | > fcz - 1. For a set Q e Q(A/) 
let £{M, Q) be the event that \Zt \ < en and 

a. all pairs {i,j) £ Q are s(i, j) -active for some s{i,j) < t, and 

b. for each i e A/ there is j' E [k] such that {i, j') is s-active at some time s satisfying 

max s(i, j) < s < t. 

If the event £(A/) occurs, then there exists Q e Q(A/) such that £(A/, Q) occurs. (In fact, if £{M) occurs, 
then by the definition of t-active, for any i E M there are at least /c2 indices j such that (i, j) is s-active for 
some s < t. We can thus let Q contain the pairs (i, j) for the 'earliest' fc2 — 1 such indices j.) Hence, by 
the union bound 

P [£{M)] < ^ P [£{M, Q)] < f \ \ maxP [£iM, Q)] . (45) 
QeQ ^ ^ ^ 

Now, fix a set M C [to], \M\ = /i, and a set Q e Q{M). If the event £{M, Q) occurs, then there exist 
I, g such that the event £{M, Q, I, g) as in Lemma I5l6l occurs . Indeed, this is precisely what we pointed 
out in Al, A2 above. Thus, by the union bound 

(fe2-l)M 

P[£{M,Q)] < YP[£{M,Q,I,g)]< E E E P [£iM,Q,I,g)] 

I,g y=l /C[ml:|/|=i/g:A/^/x[fc] 

< ^E , , max V[£{M,Q,I,g)]. (46) 



According to Lemma [531 



(47) 



Combining (l46T l and ( |47] |. we obtain 



^ <r„:T)(^")-'«'E"(9)"('=^)-'- 



{k2-l)fJ. 



T* +en 



iy=l 



Since the largest summand is the one with u ~ {k2 — 1)m \Q\ — (^2 — 1)//, we obtain 

Plugging (|48]) into (l45T l, we get 



(48) 



(49) 



Finally, (HHi and yield 



< 2fc^ 

< 2fc^ 



T* + en 
en + 1 

T* + en 
en + 1 



4e2''p 

ke 



(2.13ep) 



(fc2-l) 



(50) 



If P < Po = 1/25, then (2.13ep)'''"~^^ < exp(-/c2/100) for A: > fco large enough. Hence, ^ 
yields for k > ko large enough 



rn+l 



P [£] < 2kfi {^* ^ I cxp(-fc2p/2) < 2fc^ ( ^(^* + ) exp(-fc2M/100) 
£7^ + 1 / \ en ' 



< 2kfi 



< exp [2eri, — en Ine — fc2/^/100 + o(n)] 

< exp[n(2£-eln£-fc2e/400 + o(l))] 

< exp[-nfc2£/401] = 0(1), 



[by our choice of /i] 

[by our choice of e, cf. ©J. 



as desired. 



□ 
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